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ABSTRACT:  Observations  from  inspection  by  a  ’test'  method  and  a  standard 


method  are  combined  to  provide  estimators  of  population  proportion,  and 
of  probabilities  of  misclassification  for  the  test  method.  Results  of 
Hochberg  and  Tenenbein  (1983)  and  of  Albers  and  Veldman  (1984)  are 
extended  to  the  case  where  the  standard  method  is  not  perfect,  but  its 
misclassification  probabilities  have  known  values.  Both  moment  and  maximum 


likelihood  estimators  are  considered  and  some  asymptotic  properties  of  the 

resulting  estimators  are  compared.  ' 
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Key  Words  and  Phrases:  Binomial  distribution;  information  matrix; 

inspection  errors;  maximum  likelihood;  method  of  moments;  statistical 
differentials. 


1.  INTRODUCTION 


Suppose  we  have,  a  large  population,  containing  an  unknown  proportion, 

P,  of  individuals  possessing  a  certain  characteristic,  which  we  will 
call  'nonconformance'.  In  a  random  sample,  of  size  n,  from  this  population, 
the  distribution  of  the  number,  X,  say,  of  nonconforming  individuals 
will  be  binomial  with  parameters  n,P  so  that 

Pr[X=x]  =  PX(l-P)n‘X  (x  =  0,1,. . . ,n) . 

We  will  represent  this,  symbolically,  as 

X  z-' Bin(n,P)  ,  where  denotes  "is  distributed  as". 

If  the  individuals  in  the  sample  of  size  n  are  examined  by  an 
imperfect  measuring  device,  which  detects  actual  nonconformance  with 
probability  p,  and  (incorrectly)  'detects'  nonconformance,  when  the 
individual  is  really  not  nonconforming,  with  probability  p',  then  the 
distribution  of  Z,  the  number  of  individuals  declared  to  be  nonconforming, 
as  a  result  of  this  inspection,  will  be  binomial  with  parameters  n, 

Pp+  (l-P)p'  .  It  is  clear  that  the  only  parameter  that  can  be  estimated 
from  observations  on  values  of  Z  in  independent  samples  is  I^>  +  (l-P)p'  . 

Various  methods  have  been  suggested  for  obtaining  data  from  which 
estimates  of  P,  p  and  p'  can  be  derived  (e.g.  Albers  and  Veldman  (1984), 
Johnson  and  Kotz  (1985)).  Tenenbein  (1970)  suggested  additional  inspection 
of  part  of  the  sample  by  a  perfect  measuring  device  (for  which  p=l  and 
p'=0)  and  utilizing  the  resultant  data.  This  method  has  been  extended  by 
Hochberg  and  Tenenbein  (1983)  to  allow  for  inspection  of  a  further  sample, 
of  size  ng,  say,  by  the  perfect  measuring  device  (S) . 


In  this  paper,  we  study  problems  arising  in  this  latter  situation  if 
the  'established*  measuring  device  S  is  not  perfect,  but  has  known  values 
Ps,  p^  for  p,p*  respectively.  For  convenience,  we  will  denote  the 
(unknown)  values  of  p,p'  for  the  measuring  device  under  test  (T)  by 
PT,p^.  respectively.  We  will  also  assume  (when  necessary)  that  pg  »  pg 
and  pT  »  Pj  . 

2.  ANALYSIS  I  (Moment  Estimation) 

As  a  consequence  of  the  inspections  we  have  the  following  sets  of 
observations : 

(i)  ns  using  S  alone,  with  Zg  judged  nonconforming  (NC) , 

(ii)  Op  using  T  alone,  with  Z^.  judged  NC, 

(iii)  n  using  both  S  and  T,  with  results  shown  below: 


(#  denotes  'number  of'.)  Evidently,  Zll  +  Z10+Z01  +  Z00  n- 

Under  the  assumption  of  random  sampling  from  a  population  of  effectively 
infinite  size,  we  have  that: 
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with  <j>  *  PgP^P  +  PgP^fl-P)  ,  where denotes  "is  distributed  as". 

Recall  that  Pg  and  p£  have  known  values,  and  P  is  the  (unknown) 
proportion  of  NC  individuals  in  the  population. 

Hence 

(ng  +  n)6g  »  Zg  +  Z1Q  +  Z11-^  Bin(ng  +  n,  0g)  (2.1) 

(nj,  +  n)9y  =  Zj  +  ZQ1  +  ^  Bin(nj,  +  n,  9^,)  (2.2) 

n<j>  =  Bin(n,4>)  (2.3) 

«s» 

so  that  9g  0^,  and  $  (as  defined  in  (2.1) -(2. 3))  are  unbiased  estimators 
of  0g,  0T  and  <J>  respectively. 

Hence  P  =  (pg-p£)  "^V1^  C3-1) 

is  an  unbiased  estimator  of  P  .  Although  the  estimators 


PT  "  (Qg'Pg)  Ps  ®T^ 
and  P^.  =  (pg-Qg) 


(3.2) 

(3.3) 


are  not  unbiased  estimators  of  p^  and  p,j,  respectively,  the  biases  should  not 
be  large  if  sample  sizes  are  adequate  (see  the  example  later  in  this  section) . 
The  variance- covariance  matrix  of  the  random  variables  in  (2.1) -(2. 3)  is 


Var  ( (ng+n)  9g ,  (n^+n)  0^,n<f>) 


Hence  (cf.  (3.1)) 


(ng+n)  g(l-®g) 

n(<P"0SeT) 

n<j)(l-0g) 


n(4»-9g9T) 
(xiji+n)  9j  ( 1  -  9y) 

n<j»(l-0T) 


n<J>(l-9g) 

nd>(l-0T) 
nq>  (1~4>) 


(4) 


Var(P)  =  (ng+nr^Pg-py'^gd-Sg) 


(5.1) 


and,  using  the  method  of  statistical  differentials  (see,  e.g.  Johnson  and 
Kotz  (1969,  Chapter  1,  Section  7. 5))  we  obtain,  after  some  algebraic  manipulation: 


var(p^.)=pTV2(Ps-Ps)"2Qn"1<i>C1"^^‘2CnT+n)  1Ps<K1'0t)  +  (lyti)  jpT 

-2(ns+n)"1{<Kl-esVn(nT+n)'1p^(<j>-9seT)}pT'1  +  (ns+n)‘19s(l-0s)J  (5.2 

An  approximate  expression  for  var(pjO  is  obtained  from  (5.2)  by 

replacing  p^.  by  p^.  and  P  by  (1-P) ,  and  interchanging  pg  and  Pg  . 

An  approximate  formula  for  the  bias  of  p^.  is 

•  ' 

.  var(0  )  cov(9_,  $-p’  «L) 

E[Pxl  -  Pt  7  Pt  - S - 5 - ^“L  ■  (*) 

CQg-py  (0s‘psK<|,"ps  V 

which,  after  some  reduction,  gives  a  proportional  bias  (i.e.  100 (bias) /p^.%) 

iootn^q-^)  +  n(l-p^)es}(»-9s9T)  (7) 

(ns+n)  (ry-n)  (0s“Py  ^OPgV  0 

From  (1.1)- (1.4) 

VPS  =  Cps'ps)p ;  *‘ps  9t  =  *r<Vps)p 

and  4>-0s9t  =  P(l-P)  (Pg’Pg)  CPT-pf)  ,  so 

the  approximate  proportional  bias  (7)  is 

100(iy>g(l-9g)  +  n(l-Pg)9g}  (l-P)(Pj-pj)  ^  ^ 

(ng+n)  (ry-n)P2  (ps-py  2pT 

which  is  positive  and  (since  p^  <  p^)  less  than 

100  G(l-P)  0 7  ra\ 


(ng+n)P^(ps-py 


G  =  iyn  PS(1'0S5  +  iy-n  C1‘PS^0S  ’ 
which  lies  between  p£(l-6g)  and  (l-p£)6g  . 

Example.  Using  as  'typical'  values  of  the  probabilities  Pg,  Pg  and  P  the  values 
0.9,  0.1  and  0.1  respectively  we  find  that 

G  =  (nT+n)'1(0.082nT  +  0.162n) 

(so  that  G  lies  between  0.082  and  0.162)  and  the  approximate  proportional  bias 
of  pT  is  between  0  and  1406.25  G (ng+n) ~ 1 I  .  Note  that  the  upper  limit  is 
less  than  227.8  (ng+n)_1%,  so  if  ng  +  n>100  the  approximate  proportional  bias 
is  less  than  2.28%.  The  next  section  contains  a  numerical  assessment  of 
formula  (5. 2) ,  without  specifying  values  of  p^  and  pj. . 


3.  Some  Numerical  Approximations 


Utilizing  the  reasonable  assumption  that  Pg»  Pg  ,  and  neglecting 
? 

terns  in  Pg  and  Pg  in  the  numerator  of  (5.2)  we  find 

„  -  2  n-2,  „-2  ,  0S(1'0S) 

Var(pT)  -  pT  P  (p  -p'1  —5 - + -  ■  (] 

b  5  npT  (ns+n)pT  ng+n 

Taking  Pg  *  0.9,  Pg  =  0.1  so  that  9g  =  0.8P+Q.land  <{>  =  0.9  p^P+0.1  p^.(l-P) 


we  obtain  from  (10) 


Var(pT)  =  P-T--^ 
1  0.64P' 


{0.9?^  +  0.1pj.(l-P)Hl-0.9prP- O.lpJ(l-P)} 


(0.8P+0.1) (0.9-0.8P)  -  {l.8pjP  +  0.2pj(l-P)) (0.9-0. 8P)p^X 


n„  +  n 


Now  taking  P  =  0.1,  we  find 


..  PT2  j"o .  09  (PT+P-p  (1-0.09  (pT+Py) } 


=  (p j+Pj)  { i -  o .  09  (pt+p^)  }  -  — — ^  .  11  (12) 

n  ng+n  pT 

Since  0 . 09 (p^+pj,)  (1-0.09 (p^+Pj) }  <  j  (because  0.09(p^+p^)  <  1)  the  right  hand 
side  of  (6)  is  less  than 

(0.0256n)  1  <  39.1  n'1  . 

In  the  next  section  we  will  compare  the  asymptotic  variances  and  covariances 
of  9g>9>j>  and  P  with  those  for  maximum  likelihood  estimators  0g,  8^.  and  P  of  0g, 

0T  and  P  respectively. 


4.  ANALYSIS  II 


cimum  Likelihood  Estimators) 


The  likelihood  function  of  ZS’  ZT  and  Z  is 


nS  "T  I  11  a  \ 


7  7  7  7  7  7  0S  OM 

^s  4t  hvho,Loi^oo\  a 


ns  Zs  4)Zll(0s-4>)Zl°(eT-<t))  O1(l-0s-0T+<}))  oc 


Equating  derivatives  of  the  log -likelihood  to  zero  gives  the  following  equations 
for  §s,  0T  and  $  : 


1§  .  ns~zs  +  ho _ ho_ 


A  A  A 


(13.1) 


6s  1'0s  es*  1-0  s' V* 


Zj  "  I^-Zj  +  ZQ1  "  Z00  = 

A  A.  A  A  A  A  A 

0T  i-0T  eT-0  i-0s-eT+<() 


(13.2) 


hi  _  zio  _  zoi  +  zoo  =  0 

A  A  A  A  A  A  A  A 

$  9g-<}>  0^,-0  l-0g-6j+<j) 


(13.3) 


The  information  matrix  is 


l-9g-0j+(}> 

n(i-eT) 

(0s-4>)(i-0s-eT+<j>) 


0T(1-0T]  C9t-4>3  C1-0s-0t+4>) 


C0T-<j>)  (1-0S-0T+*) 


n(l-0g)  n{0g0T(l-9g-eT)+20g0T((.-4)  } 

(0T-<j))  (1-0S-0T+(J>)  <t>(0s-<l>)  (0T-<J>)  (i-es-eT+4>) 


The  determinant  is 


ngOr  Y 


<l>C0s-4.)  C0t-4>)  Ci-es-0T+d>D  ^  0g0T(i-0s)(i-0T) 

with  y  =  0s0T(l-0g-0T+^J  -  <K<J>-0g6T)  . 


+  nCn+iym) 


and  from  the  asymptotic  variance- covariance  matrix  V  we  obtain 


|v  I  |0T(i-0T)  ^<t>  9s-<j)  0t-4>  i-0s-eT+(j) 

?  0S(1-0S)  (Yng^  +  fiN^VlY+S) 

where  N  =  ng+n^+n  (=  total  number  of  observations)  and 


0j-<f>  l-9g-6y+<|)  <p  0g-<p 


9g0TCl-0g)(l-9T) 


The  MLE  of  P  is 


P  -  (Ps-py'heg-pp 

The  asymptotic  efficiency  of  P  (see  (3.1))  is  the  same  as  that  of  9g,  which  is 
100(nc+n)(Y  n  ■1  +  5N"1)/(y+6)  % 


Taking  pg=p^.=0.9  ,  pg=p£=0.1=P  and  ng=nj.=n  (=  N)  we  find 
Y=  0.0184680  and  6  =  0.0653573,  so  (17)  becomes 

100(ns+n)  (0.2203  ns_1+  0.7797  N'1) 

=  2(0.2203+  0.2599)  =  96.04%  . 

The  asymptotic  variance  of  the  MLE  $  is 

^  .  1  64>(l-4>)  ^^^^{ngOg^a-eg)  +nT0T"1(l-eT)}  +  <j,(0s-<(,)(0T-(J.)(l-0s-eT+(J.) 

*  n  6+y 


On  the  other  hand,  recalling  that  var(4>)  =  n  (1~4>)  *  we  find  for  the 
numerical  values  of  the  parameters  used  above,  that  the  asymptotic  efficiency 
of  the  moment  estimator  <p  is 

100  x  0-0653573x  0.09(0.91  -  (2/3)  *  0.09  x  (0.18)'1  *  0.82}  +  0.09*  0.00^x0,73 

0.09*  0.91  0.0653573  +  0.0184680) 

=  100  x  0,0037449  +  0.0005322  .  62<30% 

0.0068653 

The  markedly  lower  asymptotic  efficiency  of  $  is  associated  with  the  fact 
that  it  does  not  utilize  the  information  on  values  of  8g  and  0^  which  is 
available  from  the  other  (ng+Op)  observations.  Some  support  for  this  statement 
comes  from  the  asymptotic  efficiency  of  $  if  the  values  of  0g  and  0^,  are  known. 
This  is 

(0o-<lO  (eT-40  (i-9c- eT+<j>) 

100  X  —2 - i - -  (1 

y(1-4>) 

With  the  numerical  values  of  0g,  0^.  and  4>  which  we  have  been  using  above  this 
would  give  an  asymptotic  efficiency  of  only  35.18%. 
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5.  CORRELATIONS 

The  asymptotic  covariances  between  the  MLE's  0g  and  8^,  and  between 
8g  and  are 

cov(§s,0T)  =  N"1  6(4.-6s0t)/(y+6)  (20) 

and  cov(0g,$)  =  N  *  5  ^{nnij,  1  Cl~d>0g  1)  +  l^Qg^/Cy+S)  (21) 

Since  0  £  0g0,p  £  <P  £  0g  £  1>  the  covariances  cannot  be  negative.  It  is 
of  interest  to  compare  (20)  with  die  covariance  between  the  moment  estimators 
0g  and  0^,  which  is 

cov(0s,0T)  =  n(ng+n)  1(nT+n'1(<f>-0s0T)  (22) 

and  (21)  with  the  covariance  between  8g  and  4> ,  which  is 

cov(0g,4>)  =  (ng+n)~^<j>(l-0g)  (23) 

Using  our  illustrative  parameter  values  (pg=pT=0.9;  pg=p^=0.1=P; 
n=ng=Oj.)  we  obtain  the  following  values  for  correlations: 

corr(0g,0T)  =  0.195;  corr(0g,<j>)  =  0.475 
corr(§s,0T)  =  0.021;  corr(0g,<£)  =  0.513 

Note  that  if  n=0  (and  ng,  Up  >  0)  then 

corr(0g,8^)  =  corr(0g,0^)  =  0  . 

Also,  if  ng=np=0  (and  n>  0)  then 

corr(0s,8T)  =  corr(§s,0T)  =  (d>-0s0T) {0seT(l-0g)  (1-0T) }  5 

(=  0.390) 
and 

corr(0g,ij>)  =  corr(8g,$)  =  [<J>(l-es)/{0g(l-d>) }] 2 
(=  0.147) 

(Numbers  in  parentheses  are  values  corresponding  to  pg=pj=0.09,  pg=pj=0.1=P. 
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